696 research outputs found
On the Use of Perceptual Properties for Melody Estimation
cote interne IRCAM: Liao11aInternational audienceThis paper is about the use of perceptual principles for melody estimation. The melody stream is understood as generated by the most dominant source. Since the source with the strongest energy may not be perceptually the most dominant one, it is proposed to study the perceptual properties for melody estimation: loudness, masking effect and timbre similarity. The related criteria are integrated into a melody estimation system and their respective contributions are evaluated. The effectiveness of these perceptual criteria is confirmed by the evaluation results using more than one hundred excerpts of music recordings
Automatic Piano Transcription with Hierarchical Frequency-Time Transformer
Taking long-term spectral and temporal dependencies into account is essential
for automatic piano transcription. This is especially helpful when determining
the precise onset and offset for each note in the polyphonic piano content. In
this case, we may rely on the capability of self-attention mechanism in
Transformers to capture these long-term dependencies in the frequency and time
axes. In this work, we propose hFT-Transformer, which is an automatic music
transcription method that uses a two-level hierarchical frequency-time
Transformer architecture. The first hierarchy includes a convolutional block in
the time axis, a Transformer encoder in the frequency axis, and a Transformer
decoder that converts the dimension in the frequency axis. The output is then
fed into the second hierarchy which consists of another Transformer encoder in
the time axis. We evaluated our method with the widely used MAPS and MAESTRO
v3.0.0 datasets, and it demonstrated state-of-the-art performance on all the
F1-scores of the metrics among Frame, Note, Note with Offset, and Note with
Offset and Velocity estimations.Comment: 8 pages, 6 figures, to be published in ISMIR202
Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio Effects
We propose an end-to-end music mixing style transfer system that converts the
mixing style of an input multitrack to that of a reference song. This is
achieved with an encoder pre-trained with a contrastive objective to extract
only audio effects related information from a reference music recording. All
our models are trained in a self-supervised manner from an already-processed
wet multitrack dataset with an effective data preprocessing method that
alleviates the data scarcity of obtaining unprocessed dry data. We analyze the
proposed encoder for the disentanglement capability of audio effects and also
validate its performance for mixing style transfer through both objective and
subjective evaluations. From the results, we show the proposed system not only
converts the mixing style of multitrack audio close to a reference but is also
robust with mixture-wise style transfer upon using a music source separation
model
VRDMG: Vocal Restoration via Diffusion Posterior Sampling with Multiple Guidance
Restoring degraded music signals is essential to enhance audio quality for
downstream music manipulation. Recent diffusion-based music restoration methods
have demonstrated impressive performance, and among them, diffusion posterior
sampling (DPS) stands out given its intrinsic properties, making it versatile
across various restoration tasks. In this paper, we identify that there are
potential issues which will degrade current DPS-based methods' performance and
introduce the way to mitigate the issues inspired by diverse diffusion guidance
techniques including the RePaint (RP) strategy and the Pseudoinverse-Guided
Diffusion Models (GDM). We demonstrate our methods for the vocal
declipping and bandwidth extension tasks under various levels of distortion and
cutoff frequency, respectively. In both tasks, our methods outperform the
current DPS-based music restoration benchmarks. We refer to
\url{http://carlosholivan.github.io/demos/audio-restoration-2023.html} for
examples of the restored audio samples
Bacteremic pneumonia caused by Nocardia veterana in an HIV-infected patient
SummaryDisseminated Nocardia veterana infection has rarely been reported. We describe the first reported case of N. veterana bacteremic pneumonia in an HIV-infected patient. The isolate was confirmed by 16S rRNA sequencing analysis. The patient initially responded well to trimethoprim–sulfamethoxazole treatment (minimum inhibitory concentration 0.25μg/ml), but died of ventilator-associated pneumonia
Kinematic Analyses of a Parallel-type Independently Controllable Transmission
This study proposes a novel design of a parallel-type Independently Controllable Transmission (ICT). The parallel-type ICT can produce a continuously variable transmission ratio and a required angular output velocity that can be independently manipulated by a controller yet not affected by the angular velocity of the input shaft. The proposed parallel-type ICT is composed of two planetary gear trains and two transmission-connecting members. A prototype was built to investigate its kinematic characteristics and verify application feasibility
Ample Pairs
We show that the ample degree of a stable theory with trivial forking is
preserved when we consider the corresponding theory of belles paires, if it
exists. This result also applies to the theory of -structures of a trivial
theory of rank .Comment: Research partially supported by the program MTM2014-59178-P. The
second author conducted research with support of the programme
ANR-13-BS01-0006 Valcomo. The third author would like to thank the European
Research Council grant 33882
- …